Model-Based Reinforcement Learning via Stochastic Hybrid Models

نویسندگان

چکیده

Optimal control of general nonlinear systems is a central challenge in automation. Enabled by powerful function approximators, data-driven approaches to have recently successfully tackled challenging applications. However, such methods often obscure the structure dynamics and behind black-box over-parameterized representations, thus limiting our ability understand closed-loop behavior. This article adopts hybrid-system view modeling that lends an explicit hierarchical problem breaks down complex into simpler localized units. We consider sequence paradigm captures temporal data derive expectation-maximization (EM) algorithm automatically decomposes stochastic piecewise affine models with transition boundaries. Furthermore, we show these time-series naturally admit extension use extract local polynomial feedback controllers from experts via behavioral cloning. Finally, introduce novel hybrid relative entropy policy search (Hb-REPS) technique incorporates nature optimizes set time-invariant derived approximation global state-value function.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transferring Models in Hybrid Reinforcement Learning Agents

The main objective of transfer learning is to reuse knowledge acquired in a previous learned task, in order to enhance the learning procedure in a new and more complex task. Transfer learning comprises a suitable solution for speeding up the learning procedure in Reinforcement Learning tasks. In this work, we propose a novel method for transferring models to a hybrid reinforcement learning agen...

متن کامل

Reinforcement Learning: Model-based

Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...

متن کامل

Model-Based Reinforcement Learning

Reinforcement Learning (RL) refers to learning to behave optimally in a stochastic environment by taking actions and receiving rewards [1]. The environment is assumed Markovian in that there is a fixed probability of the next state given the current state and the agent’s action. The agent also receives an immediate reward based on the current state and the action. Models of the next-state distr...

متن کامل

Reinforcement Learning using Kernel-Based Stochastic Factorization

Kernel-based reinforcement-learning (KBRL) is a method for learning a decision policy from a set of sample transitions which stands out for its strong theoretical guarantees. However, the size of the approximator grows with the number of transitions, which makes the approach impractical for large problems. In this paper we introduce a novel algorithm to improve the scalability of KBRL. We resor...

متن کامل

Model-Based Probabilistic Pursuit via Inverse Reinforcement Learning

In this paper we address the integrated prediction, planning, and control problem that enables a single follower robot (the “photographer”) to quickly re-establish visual contact with a moving target (the “subject”) that has escaped the follower’s field of view. Our work addresses this unavoidable scenario, which reactive controllers are typically ill-equipped to handle, by making intelligent p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE open journal of control systems

سال: 2023

ISSN: ['2694-085X']

DOI: https://doi.org/10.1109/ojcsys.2023.3277308